A Parallel Algorithm for Gene Expressing Data Biclustering
نویسندگان
چکیده
Biclustering of the gene expressing data is an important task in bioinformatics. By clustering the gene expressing data obtained under different experimental conditions, function and regulatory elements of the gene sequence can be analyzed and recognized. A parallel biclustering algorithm for gene expressing data is presented. Based on the anti-monotones property of the quality of the data sets with their sizes, the algorithm starts from the data sets containing of all the 2*2 submatrices of the gene expressing data matrix, and gets the final biclusters by gradually adding columns and rows on the data sets. Experimental results show that our algorithm has superiority over other similar algorithms in terms of processing speedup and quality of clustering and efficiency.
منابع مشابه
An Improved Biclustering Algorithm for Gene Expression Data
Cheng-Church (CC) biclustering algorithm is the popular algorithm for the gene expression data mining at present. Only find one biclustering can be found at one time and the biclustering that overlap each other can hardly be found when using this algorithm. This article puts forward a modified algorithm for the gene expression data mining that uses the middle biclustering result to conduct the ...
متن کاملHigh Performance Parallel/Distributed Biclustering Using Barycenter Heuristic
Biclustering refers to simultaneous clustering of objects and their features. Use of biclustering is gaining momentum in areas such as text mining, gene expression analysis and collaborative filtering. Due to requirements for high performance in large scale data processing applications such as Collaborative filtering in E-commerce systems and large scale genome-wide gene expression analysis in ...
متن کاملSPHier: Scalable Parallel Biclustering Using Weighted Bigraph Crossing Minimization
Biclustering is used for discovering correlations among subsets of attributes with subsets of transactions in a transaction database. It has an extensive set of applications ranging from Gene co-regulation analysis[4], documentkeyword clustering[12] and collaborative filtering for online recommendation systems[13]. In this paper, we propose optimal biclustering problem as maximal crossing numbe...
متن کاملEBIC: an artificial intelligence-based parallel biclustering algorithm for pattern discovery
In this paper a novel biclustering algorithm based on artificial intelligence (AI) is introduced. The method called EBIC aims to detect biologically meaningful, order-preserving patterns in complex data. The proposed algorithm is probably the first one capable of discovering with accuracy exceeding 50% multiple complex patterns in real gene expression datasets. It is also one of the very few bi...
متن کاملAn FPGA-based Geometric Biclustering Accelerator for Genes Microarray Data Analysis
This paper introduces a novel hardware architecture for accelerating geometric biclustering (GBC) algorithm for genes microarray data analysis on FPGA. The proposed FPGAbased accelerator provides high throughput parallel processing capability and improves the speed of GBC computation by 30% compared to purely software implementation written in C language.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- JCP
دوره 3 شماره
صفحات -
تاریخ انتشار 2008